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Abstract 


In 1986, Van Rijsbergen suggested a model of an information retrieval system 
based on logic. We have advocated in earlier work that a logical approach 
should be based on a theory of information, Situation Theory, which provides a 
powerful range of concepts, and is useful for modelling documents and queries 
for the purpose of information retrieval. We also showed that Situation The- 
ory provides a framework to represent different types of information retrieval 
models, thus allowing speculation on their properties and their characteriza- 
tion language. This paper is an essay to convince the reader that Situation 
Theory presents many characteristics that are both adequate and appropriate 
for the modelling and the study of information retrieval. 


1 Introduction 


In 1986, Van Rijsbergen [37, 38, 39] suggested a model of an IR system based on logic because 
the use of an adequate logic provides all the necessary concepts to represent the different 
functions of an IR system. In a logical IR model!, the information content of a document is 
represented by a formula d and the information need, as phrased in the query, is represented 
by a formula g. The “truth” of d — q would mean that the query formula can be inferred from 
the document formula. To put it another way, the information captured by d is sufficient to 
infer the information represented by q. In the IR world, this could be viewed as the document 
satisfying (or being relevant? to) the query. 


We have advocated in earlier work that a logical approach should be based on a theory of 
information, Situation Theory {1, 13, 5], which provides a powerful range of concepts, and 
is useful for modelling documents and queries for the purpose of IR [28, 29, 40, 26]. We 
also showed that Situation Theory provides a framework for representing different types of IR 
models, thus allowing speculation on their properties and their characterization language [20, 
22, 19, 6, 23). 


This paper is an essay to convince the reader that Situation Theory presents many characteristics 


' A survey of the use of logic for IR modelling can be found in [27]. 
2 In this paper, the term “relevance” is used in both contexts, with respect to the user (which is the correct use in the IR world) 
and with respect to the system (indicating good satisfaction). 


11 


that are both adequate and appropriate for the modelling and the study of IR. Here, we 
concentrate on those we have experience with: 


« A situation theoretical IR model. 
¢ A situation theoretical framework for studying IR models. 


The organization of the paper is as follows. In section 2, the reasons for using Situation Theory 
as the theory to model an IR system are given. Section 3 focuses on the issue of Situation 
Theory as a meta-theory for IR. In the last section some conclusions and some guidelines for 
further research are presented. 


2 An Information Retrieval Model 
based on Situation Theory 


The use of logic to model IR seems to provide a more accurate model of information and its 
flow, which are principal concerns in IR. Indeed, in the Oxford English Dictionary, logic is 
defined as: 


“The branch of philosophy that treats of the form of thinking in general, and more 
especially of inference and scientific method.” 


That is, logic is a formalization of the way we use information in our everyday life to think, infer, 
conclude, acquire knowledge, make decisions and so forth. In this sense, logic undertakes to 
model information and its flow. In [40], we discussed the importance of the flow of information 
in IR. For example, in textual IR systems, the flow allows us to read (recognition of letters, 
words, and sentences); the flow allows us to understand what we are reading (semantics); and the 
flow allows us, with respect to our knowledge of the subject, to derive additional information 
from what we have read (pragmatics). 


The use of logic for modelling IR has been pursued by authors such as [32, 8, 35, 31], all of 
whom proposed interesting frameworks, although these authors have all adopted a truth-based 
logic, which corresponds to the second view of logic in the Oxford English Dictionary: 


“Also since the work of Frege (1848-1925), [logic is] a formal system using symbolic 
techniques and mathematical methods to establish truth-values in the physical sciences, 
in language, and in philosophical argument.” 


In many domains that relate to information, such as artificial intelligence, databases, linguistics 
and even philosophy, information is represented by some structure or calculus that is built on 
the concept of truth. We, on the other hand, following the line of [15, 30, 1, 13, 4, 3], and so 
forth, advance a logical model of IR using a logic of information and its flow. For this reason, 
we have chosen to use a theory, namely Situation Theory [1, 13], that is primarily concerned 
with the representation of information and its flow. 


In this section, we show the appropriateness of Situation Theory for modelling IR. We base our 
discussion on the work of Dretske [15], who provides a comprehensive dissertation about the 
role of information and its flow. Indeed, the philosophy behind the development of Situation 
Theory conforms to many of the points expressed by Dretske. Those relevant to IR are discussed 
in this section, and their representations within Situation Theory ontology are emphasized. The 
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main components of Situation Theory are given first. 


2.1 Infons, situations and types 


Situation Theory represents information without specifically indicating what information is. It 
considers information as a fundamental entity; from this, a model of the flow of information is 
derived. Situation Theory is concerned with information of the form: 


A property P holds / does not hold for the set of objects a1,...,@n- 


These two items of information are modelled by two infons, one being the dual of the other, 
respectively: 


< Piay,...,dn;1 >> and < P,ay,...,a,;0 > 


Situations are parts of the world from which information is extracted. Let o be an infon 
representing an item of information. If a situation s makes this information true, this is denoted 
s — o (read “support”). 


Types represent the uniformities that cut across infons. For example, the three following infons: 


< Weather, Glasgow, sunny; 1 >> 
< Weather, Windsor, sunny; 1 > 
< Weather, Algiers, sunny; 1 > 


have the common information that it is sunny. What differs in these infons is the city. The 
type abstracting among these infons can be defined as: 


yp = [s|s E< Weather, é, sunny; 1 >] 


which is the type of any situation about a city (represented in the type by the parameter c) 
where the sun is shining. If s is one of them, this is written s K ¢°. 


2.2 Signal 


Dretske [15] explains that information is knowledge about a source; it is communicated by a 
signal to a receiver. In IR, the source is the document and the receiver is anybody observing 
the document (reading a text, listening to an audiotape or observing an image). Signals are 
whatever means by which the information about a source is delivered to the receiver. For 
example, if the document is a text, the signal is a mixture of the reader’s vision capability, his 
or her understanding of the information read, and his or her general knowledge about its subject. 
A signal can also be the indexing process which delivers a representation of the information 
content of the document. 


Dretske [15] defines a source as any structure with an information content; situation is of 
primary concern here. Let d be a situation and ¢ be an information item. If the signal carries 
the information that d contains ¢y (or, as expressed by Drestke, d is y), this is written in 
Situation Theory d F ¢. 


3 In [13], a detailed description of infons and situations, together with a set of rules that ensure proper instantiation (called 
anchoring) of parameters, is given. The understanding of these rules goes beyond the scope of this paper, so they are not 
discussed further. 
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A signal which carries d — ¢ often carries additional information about the situation d or other 
situations owing to the very fact that d supports y. This information is said to be nested into 
the fact that d supports y. 


Dretske notes that information can be encoded in two forms: a signal carries the information 
dF ¢ in digital form if, and only if, this information is not nested in any other information 
carried by that signal; otherwise, the signal carries this information in analog form. 


2.3 Perception 


The amount of information contained in documents is usually very large. One could compare 
a (physical) document’s information content to a sensory experience, which often embodies a 
great variety of details. According to Dretske [15], the perception of a sensory experience is 
the process by which information is delivered to a cognitive agent for its selective use. It is 
identified with a signal that carries information about a source which is coded in analog form. 
Until information has been extracted from this signal, nothing corresponding to recognition, 
classification or identification has occurred. It is the successful conversion of information into 
the appropriate digital form that constitutes the essence of cognitive activity. In Situation 
Theory, situations are the objects of perception. They provide the information that signals carry 
in analog form. 


A perception process often embodies information about a variety of details that, if carried 
over in total to the cognitive agent, would require immense storage and retrieval capabilities. 
Moreover, there is more information than can be extracted and/or exploited by the cognitive 
agent. Only some of the information the perception process carries in analog form is retained. 
The same holds true for most (if not all) IR systems. The indexing of a document does not 
give an exhaustive description of the information content of that document. There would be 
too much information to store, and sometimes it is not even possible to exhaustively determine 
the information content. 


A perception process is determined not by what information is carried, but by the way it is 
carried. Seeing, hearing or reading are not different processes because of the information they 
carry (the information might be the same), but because of the vehicle by which this information 
is delivered. Two different concepts are involved here: how the information is delivered and 
what the information represents. Situation Theory is concerned with the latter, for a situation 
can be a text, an image or a speech. Therefore, a model based on Situation Theory could 
eventually incorporate multimedia IR systems. 


2.4 Cognition 


Dretske [15] describes cognition as the conversion of the information a cognitive agent receives 
in analog form into digital form. The resulting information is usually qualified as a knowledge 
with respect to the cognitive agent. The conversion, referred to by Dretske as digitalization, 
involves a loss of information because it turns a structure of greater information content into 
one of lesser information content. What is gained by this loss is some sort of classification 
of the information provided. 


The indexing process in IR can be compared to a digitalization process. The IR system is the 
cognitive agent. The document is a situation that contains information in analog form. The 
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information which is (successfully) digitalized constitutes the document representation. The 
goal is to minimize the loss of information involved in the conversion while at the same time 
obtaining a small enough document representation for both storage capacity and retrieval speed. 
The outcome of the cognition process is often a partial representation of the information carried 
about a source, which is a document in IR. 


Some researchers [30, 1] refer to a situation as a partial object, which can contain a vast amount 
of information, though only part of it is digitalized. Moreover, whether an item of information 
is to be digitalized or not depends on two properties attached to the cognitive agent: 


* its capability of perception. For example, a human being and a robot do not perceive 
information at the same level. A robot can identify entities that a human being cannot, 
and vice versa. 


* its focus of attention, because cognitive agents are often constructed to fulfill a task. For 
example, the color of a wall may be of no interest to a moving device whose purpose 
is to avoid the wall. 


The essence of Situation Theory is to capture these facts, which is often not the case with many 
other frameworks such as those based on truth (e.g., Classical Logic, Default Theory [33], 
Modal Logic [18]). In these, every representation of an information item is assessed either 
to belong or not to belong to the document (the assessment is often a truth value). This is 
unreasonable because the fact that an item of information is not initially present in a document 
should not imply that this item of information is not part of that document’s information content. 
This information may be implicit in the document. 


A document can be identified as a situation since a document contains the information written in 
its text, or contained in its images. In text documents, the digital information is the information 
that is made explicit by the indexing process. This information is strongly dependent of the 
properties on the indexer (the cognitive agent). 


2.5 Information vs. meaning 


Dretske [15] claims that information and meaning are two different concepts. Indeed, there is 
no reason to assume that the information a signal carries is identical to its meaning. Often, the 
information contained in a signal exceeds its meaning. For example, the statement “Keith is at 
home” means that Keith is indeed at home. It does not mean that “Keith is at home and not at 
work”, though Keith being at home implies that Keith is at home and not at work. A signal that 
carries “Keith is at home” also carries “Keith is at home and not at work”. This information 
follows from the flow of information that springs from the meaning attached to the statement. 
This difference between information and meaning is highlighted in the quote ([15]): 


“cc 


information is that commodity of yielding knowledge, and what information a 
signal carries is what we can learn from it”. 


In IR, understanding the meaning attached to the sentences of a document is important, but 
is insufficient for determining the information content of the document. A more thorough 
representation of a document’s information content needs the appropriate capturing of the flow 
of information. This is why frameworks such as Montague Semantics [14] are not appropriate, 
since they are theories of meaning, whereas Situation Theory is a theory of information. 
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2.6 Flow of information 


During the indexing process in IR, the information that is digitalized constitutes the document’s 
initial representation, which is not an exhaustive description of the information content of 
that document. This representation customarily depicts the explicit information content of the 
document, which is determined at most from the meaning of the sentences of the document. 
Additional information can often be identified as part of the document information content. 
This additional information constitutes the implicit information content of the document. It is 
carried in analog form and remains so until extracted (or digitalized) by the indexing process. 
After this, it is carried in digital form, that is, as part of the (known) information content of the 
document. The existence of this information is due to the flow of information that arises from 
the explicit information content of the document. 


It may help in understanding this notion of flow of information to consider some examples. A 
situation showing smoke contains information that there is a fire because of the constraint that 
smoke usually means fire. Squareness for an object contains the information that the object is 
a rectangle relative to an analytic constraint. A document about Pascal contains information 
about programming languages. 


We can see that the flow of information characterizes information containment and can generally 
be defined as the information an object contains or carries about itself or another object. The 
purpose of an IR system is to provide information about a query, which is a representation of an 
information need that an IR system attempts to satisfy. Hence, determining relevance consists 
of computing the information contained in one object (e.g., a document) about another (e.g., 
a query). This problem is the identification of a flow of information between the document 
object and the query object. 


Information containment is usually described by (informative) relationships between items of 
information and the object affected by this information containment. Let there be a relationship 
between the two information items p and p’ (e.g., synonymity). The flow of information based 
on this relationship indicates that an object which contains the information item p contains or 
carries the information that itself or a second object contains the information item p’. 


In Situation Theory, constraints model relationships which are defined between types. Let ¢ 
and y be two types that constitute the constraint ¢ — y. The application of this constraint to a 
situation s is possible if first s; FF ¢ and then informs of the existence of a situation s2 such that 
sq FF y: the fact s; F ¢ carries the information that sz F y. A flow of information circulates 
between the situations s; and s2, and the nature of the flow is defined by the constraint ¢ > y. 


If s; is the document object and s2 is the query object, then the existence of the flow that 
circulates between s; and s2 means that the document is relevant to the query. The nature 
of the flow is determined by the constraints, which, in IR, can model any thesaural, semantic 
relationships (e.g., “Holland” and “The Netherlands” are synonymous) or pragmatic relationships 
(e.g., the systematic relationship that most people attach to “wine” and “France”), or more 
complex relationships like those handled by artificial intelligence. 


2.7 Uncertain flow of information 


The flow of information can be uncertain. For example, consider the synonymous relationship 
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between the two terms ¢ and t’ as the basis of a flow. Often, two synonymous terms have 
different meanings in certain contexts. If it is not known which sense the term t refers to in a 
given context, then the flow that relates this term to t’ is uncertain. The relationship between t 
and t’ might not be appropriate with respect to that context. 


In Situation Theory, uncertain flows come from the fact that constraints do not always hold. The 
constraints that do not always hold are called conditional. A conditional constraint is written 
¢ — y | B, which highlights the fact that the constraint ¢ — y holds for a given situation if 
the background conditions captured within B are satisfied by that situation: 


¢ if the situation satisfies the background conditions, the use of the constraint is certain, 
as well as the resulting flow. 


* if the background conditions are not satisfied, the constraint cannot be used. 


* if it is not known whether the background conditions are satisfied by the situation, the 
use of the constraint is uncertain, and the resulting flow is uncertain. 


The use of background conditions in an IR model acknowledges the important fact that 
information is seen to be dependent on a context. For example, background conditions can 
represent context with respect to polysemic words. Consider the word “bank” in a document 
dealing with finance. Inference with respect to that word should relate to the “money bank” 
context, and not “river bank”. 


The background conditions can be particularly complex to identify. In everyday reasoning, 
people often use background conditions, though they are not aware of them. People often, 
if asked, cannot express them. This should not imply the non-existence of the background 
conditions. As Devlin points out [13], background conditions become a concern only when 
a constraint fails. 


Similar problems arise in IR. Take the example of a synonym-based constraint. If the term 
referred to in this constraint is polysemic, disambiguation is necessary to ensure that the 
constraint can be used. In IR, disambiguation unfortunately is not always successful or even 
possible (34, 41]. This implies that the use of such a constraint will often be uncertain, thus 
leading to an uncertain flow. The more uncertain the flow, the less relevant the document. 


In IR, it is usually insufficient to say that a document is relevant to a query; knowing the extent 
of relevance is necessary. In [25], we use the uncertainty engendered by uncertain flow as 
the basis of a numerical formulation of relevance. We achieve this by mapping a theory of 
uncertainty on top of Situation Theory. However, by using Situation Theory to model IR, the 
uncertain nature of the flow of information is already qualitatively captured, and can then be 
initially studied on its own. 


2.8 Natural Language Processing 


The use of Situation Theory provides an additional benefit because it has been used to develop 
a framework for natural language processing. This framework is called Situation Semantics [5]. 
It models the utterance of a sentence with three entities: the type that represents the information 
content of the sentence, the situation that the sentence describes, and the situation in which the 
sentence is uttered. All the components of a sentence are defined in terms of these three types 
of entities, which are combined to form the three entities of the sentence. A model based on 
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Situation Theory can use Situation Semantics as the natural language process to identify the 
types that are supported by the situation modelling the document. 


3. A Meta-Theory for Information Retrieval 
based on Situation Theory 


Having at our disposal a “meta-theory” for IR is becoming a necessity for several reasons. First, 
the effectiveness of IR systems, up to now, has been studied and compared through recall and 
precision values. We admit the great usefulness of these statistics; however, for more suitable 
statements to be made about the qualities of one model over another, we need a more formal 
means of study and comparison. 


Second, many research outcomes have indicated that significant improvement in the effective- 
ness of IR systems based on some models is difficult to obtain, but no formal proof of this fact 
can be given. A formal framework is required to carry out such proofs. 


Third, an IR model can be seen as a solution proposal for the problem of retrieving information 
that is likely to be relevant to an information need. Indeed, every IR model proposes a theory, 
sometimes ad hoc, that defines the notion of relevance. For instance, in the probabilistic models, 
the notion of relevance is explicitly defined in terms of probabilistic laws. One of them is that 
a document d is relevant to an information need q iff it holds that Pr(d | q) > Pr(d). The 
underlying theory of each IR model must be appropriately elicited if IR models are to be 
studied and compared. 


Finally, the underlying theory of an IR model is, often, based on some explicit or implicit 
assumptions, for reasons of simplicity and scalability. For example, many probabilistic models 
are based on some version of independence [12]. Representing such assumptions formally is 
essential for better understanding of the behavior of any IR model. For example, in some 
applications, some assumptions may be preferred to others, and as a result, some IR models 
should be preferred to others. 


To summarize, developing a meta-theory for IR is vital because it is the only way to answer, 
among others, the following questions: 


* Which model is preferable (in a particular case)? 


* Why are some models, when implemented, unable to make a significant improvement? 
Is there a theoretical justification for this? 


¢ What theories lie behind the retrieval mechanism? 
* What are the assumptions underlying an IR model? 


In this section, we show that Situation Theory offers enough expressiveness to be a suitable 
candidate to develop a meta-theory for studying IR. 


3.1 Situation Theory 


In a logical IR model, a document d is relevant to a query q if the document logically satisfies 
the query; in other words, if the document d is about the query g. What the different IR models 
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propose is a way to represent this aboutness relationship between documents and queries. We 
propose to use this notion of aboutness in an abstract way to study IR models. For this purpose, 
our meta-theory is defined to offer the possibility of axiomatizing the aboutness relationship 
of every IR model. 


As mentioned in Section 2, IR models are developed around the concept of information. 
Consequently, for a study of IR models we must go beyond the simplistic, often adopted 
approaches of internal representations of the documents and queries (e.g., a set of keywords, a 
Boolean expression). For instance, if the theory cannot express relations between information 
items, IR models with a relational language (e.g., Farradane’s relation indexing [16, 17]) are 
excluded. 


We use Situation Theory to develop our meta-model because we see the aboutness relationships 
as a flow of information between two situations: 


A document is about a query if the situation representing the document contains 
information about the situation representing the query. 


As different IR models propose different kinds of flows, let us first inspect the basic components 
of all IR models. The first component is the characterization language which digitalizes the 
information content of a document or the information need expressed in a query. For example, 
the majority of characterization languages are based on keyword representation; there, items of 
informations are keywords extracted from the document or the query. Following Devlin [13], 
we view this item of information as an infon. 


We represent the characterization language as a function x that takes an object (a text, an 
image, an information need) and returns a representation of the information content of that 
object. In the remainder of this paper, we refer to y(d) and x(q) as representing, respectively, 
the information content of the document d and the query qg (here, an information need). 


The second main component of all IR models is the matching function which determines whether 
a document is relevant to (is about) a query. The matching function is usually defined with 
respect to x(d) and x(q). 


To create a platform to make aboutness decisions, we have to make some explicit assumptions 
concerning aboutness. The first of these is that aboutness can be derived with some sort of logic. 


3.2 Derivation systems 


We need a general definition of the aboutness relationship to model the relevance of any IR 
models. For example, in (classical) logical models, the relationships could be between models 
and formulas; in that case, a document is about a query if the model representing the document 
logically satisfies the formula representing the query (i.e., x(d) F x(q)). In the so-called 
overlap models‘, the aboutness relationship is between sets of keywords, and a document is 
about a query if the intersection between the set of keywords representing the document and 
that representing the query is non-empty (i.e., x(d) N x(q) # 9). 


Formally, we represent the aboutness relationship as a flow of information between two 
situations: 


* These models are based on matching functions such as Dice or Jaccard‘s ones. There, the characterization language returns 
sets of keywords for both documents and queries. 
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Given two situations S and T', we write SO~ T to state that the situation S is about 
the situation T’, and we write SO T to say that S is not about T?, 


The infons supported by these situations are those obtained from the characterization language 
(e.g., a set of keywords, a Boolean expression, a vector, etc.). 


Whether a situation S is about a situation T (i.e., whether SO-~» T or SO T) may not 
always be possible to determine. However, this could be derived knowing the properties (if 
any) associated with the aboutness relationship. Therefore, we suggest that aboutness decisions 
can be determined with some sort of logical derivation. For instance, consider the union of 
a situation S with another situation S’; can we decide (derive) that SU S’O ~ T for any 
arbitrary situation T such that SO ~ T? An IR model satisfying such a rule allows the 
following relevance (aboutness) decision: 


A document on “air pollution” is relevant to a query requesting information on “water 
pollution” knowing that a document on “pollution” was relevant to that query. 


These logical derivations play an important role in IR (as well as in Situation Theory [4]). 
Making these types of derivation explicit is the underlying idea of our work. We accomplished 
this by defining a set of axioms and rules that hold for a particular aboutness relationship. This 
is done by the so-called derivation system: 


A derivation system A is a pair of the form (Az, Rule), with Az a set of axioms and 
Rule a set of rules of the form R(T1,...,7%, 7,41). Here, Ti,...,7j, are the premises 
of the rule and 7,41 is the conclusion. 


Note that we do not make any statement about the axioms or the rules of the derivation system. 
These depend on the IR model itself. For example, they could be analogous to that in classical 
logic. In that case, an aboutness relationship is derivable from another one if there is a range 
of ‘intermediate’ aboutness decisions, which are either axioms of classical logic or arise from 
previous aboutness relationships by application of a rule of classical logic. Another example 
is to consider a default theory (see (33]) as a derivation system. In that case, the derivation 
relation is defined as being an element of an extension. This derivation relation is in general 
not expressible by means of the classical logical derivation relation. Generally, this is the case 
for non-monotonic derivation relations. 


The derivation system, as defined above, results in a sufficiently abstract framework onto which 
an arbitrary retrieval mechanism can be mapped, and then studied and compared. 


3.3. Axioms and rules 


We can propose axioms that are satisfied by the aboutness relationship of some IR models. 
For instance, the Situation Theory model, as proposed in section 2, is based on the reflexivity 
axiom; that is, every situation is about itself (i.e, SO ~+ S). 


Another axiom, governed by many logical approaches and not by overlap models, is the 
transitivity rule (also known as the Xerox Principle [15]): 


Given that SO~ T and TO~ U we can conclude that S is about U, that is, SO~ VU. 


5 In the recent work pursued by Barwise [2] and Seligman [36], this explicit link between situations is formally represented by 
channels. 
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This rule is valid when the aboutness derivation is founded on Boolean logic. However, in the 
overlap models, stating that there is an overlap of information items between sets S and T’ and 
that there is an overlap between sets T and U does not imply that there is an overlap between 
sets S and U. The overlap aboutness relationship is not transitive [20]. 


3.4 Properties of IR models 


Our meta-model offers the possibility of comparing IR systems according to which axioms 
and rules they satisfy. Consider strict coordinate level matching, where d is about q if all 
the keywords used to express the query are part of those used to represent the document’s 
information content (i.e., x(q) C x(d)). Also, consider the Boolean retrieval model, where d 
is about q if the formula representing the document logically implies the formula representing 
the query (i.e., y(d)  x(q)) in a classical derivation system extended with the closed world 
assumption. We showed in [20] that the axioms of the derivation system modelling the strict 
coordinate level matching are a subset of those defining the derivation system of the Boolean 
retrieval model. Therefore, we could prove that the recall of a Boolean retrieval system is 
always higher than or equal to that of a strict coordinate level matching system (see [20, 10] 
for the proof). 


With our meta-model, we can also elicit basic properties of the derivation systems associated 
with an IR model. One important property, governed by some models and rejected by others, 
is non-monotonicity. In the Artificial Intelligence field, the non-monotonic behavior of rules 
(in their terminology, known as non-monotonic reasoning) plays an important role. A rather 
informal definition proposed by Lukaszewicz [7] clearly presents the idea behind this sort of rule: 


By non-monotonic reasoning we understand the drawing of conclusions which may be 
invalidated in the light of new information. A logical system is called non-monotonic 
iff its provability relation violates the property of monotonicity. 


In terms of IR, assume a document d which is about a query q. Now, if there is additional 
information about d (for example, derived from a thesaurus), the question arises whether in 
this model d is still about g. Some authors have argued that IR models should have this 
non-monotonic character [24, 9, 11]. 


4 Conclusions and Further Research 


From Dretske’s account of the role of a theory of information [15], Situation Theory seems 
the right framework for the qualitative modelling of the IR system. Situations and types show 
similarities with documents and their information content. Supported information corresponds 
to the explicit information content (digitalized) of the document, whereas carried information 
corresponds to its implicit information content. Constraints are the perfect tool to represent 
thesaural, or any semantic or pragmatic relationships. 


We have also presented a meta-theory for IR based on Situation Theory. There, a formal 
representation of a document’s information content and an information need have to be outlined. 
By proposing a set of axioms, implicit assumptions governing an IR mechanism can be brought 
to light. The effectiveness of a retrieval mechanism can be examined, not only by running 
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experiments, but inspecting the axioms of the model, and the properties and consequences of 
the axioms. The meta-theory has been used since to axiomatize several IR models (see for 
example [6, 21, 19, 23]). 


The use of Situation Theory to model an IR system or to develop a meta-model of an IR system 
led us to a better understanding of the nature of information in IR. This nature is multiple 
and can be studied in different perspectives. For example, in future research, we will look 
at the nature of information for user modelling. Indeed, a correct representation of the user’s 
(mental) intention certainly generates better retrieval. The attainment of such a representation 
enters the area of Cognitive Science, some aspects of which can be formally expressed with 
Situation Theory [13]. 


It is also interesting to note the use of a meta-model for studying uncertainty aspects in IR. 
Applying the method as proposed in [21] in our framework, it is possible to model relevance 
degrees at the deduction level of the underlying logic, in order to have them explicitly rest 
on preferential grounds. This is based on the claim that user preferences for documents are 
based on preferences among aboutness properties: while making his or her choice, the user 
takes for granted certain assumptions concerning aboutness, and certain preferences concerning 
these assumptions. 


Another issue which we will investigate is the representation of contexts, which recently has 
reached higher prominence in IR. For example, with networked IR (or any distributed database), 
which is increasingly widespread, it is necessary to represent the fact that retrieval is with respect 
to a site (a context). Moreover, two sites (contexts) may be involved in a retrieval process and 
the information retrieved from them must be aggregated. Situation Theory captures this notion 
of context with situations and background conditions. 
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